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Abstract 

Our recent formulation of the analytic and variational Slater-Roothaan (SR) method, which 
uses Gaussian basis sets to variationally express the molecular orbitals, electron density and the 
one body effective potential of density functional theory, is reviewed. Variational fitting can be 
extended to the resolution of identity method, where variationality then refers to the error in 
each two electron integral and not to the total energy. However, a Taylor series analysis shows 
that all analytic ab initio energies calculated with variational fits to two-electron integrals are 
stationary. It is proposed that the appropriate fitting functions be charge neutral and that all ab 
initio energies be evaluated using two-center fits of the two-electron integrals. The SR method has 
its root in the Slater's Xa method and permits an arbitrary scaling of the Slater-Gaspar-Kohn- 
Sham exchange-correlation potential around each atom in the system. The scaling factors are the 
Slater's exchange parameters a. Of several ways of choosing these parameters, two most obvious 
are the Hartree-Fock (HF) ajjF values and the exact atomic uea values. The former are obtained 
by equating the self-consistent Xa energy and the HF energies, while the latter set reproduce 
exact atomic energies. In this work, we examine the performance of the SR method for predicting 
atomization energies, bond distances, and ionization potentials using the two sets of a parameters. 
The atomization energies are calculated for the extended G2 set of 148 molecules for different basis 
set combinations. The mean error (ME) and mean absolute error (MAE) in atomization energies 
are about 25 and 33 kcal/mol, respectively for the exact atomic, oiea, values. The HF values of 
exchange parameters, auF , give somewhat better performance for the atomization energies with 
ME and MAE being about 15 and 26 kcal/mol, respectively. While both sets give performance 
better than the local density approximation or the HF theory, the errors in atomization energy 
are larger than the target chemical accuracy. To further improve the performance of the SR 
method for atomization energies, a new set of a values is determined by minimizing the MAE 
in atomization energies of 148 molecules. This new set gives atomization energies half as large 
(MAE ~ 14.5 kcal/mol) and that are slightly better than those obtained by one of the most widely 
used generalized gradient approximations. Further improvements in atomization energies require 
going beyond Slater's element-dependent functional form for exchange employed in this work to 
allow exchange-correlation interactions between electrons of different spin. The MAE in ionization 
potentials of 49 atoms and molecules is about 0.5 eV and that in bond distances of 27 molecules 
is about 0.02 A. The overall good performance of the computationally efficient SR method using 
any reasonable set of a values makes it a promising method for study of large systems. 
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Theoretical methods for electronic structure calculations in practice today can be broadly 
classified into two categories. From the perspective of density functional theory (DFT), these 
two classes of models differ from each other only in the way the unknown exact exchange- 
correlation (XC) energy functional is approximated. The first class of methods are the 
traditional quantum chemical methods such as Hartree-Fock (HF) theory and beyond []|. 
These methods are generally implemented using Gaussian basis sets. The molecular orbitals 
are analytically expressed as linear combinations of atomic orbitals (LCAO). The atomic 
orbitals are contracted sets of primitive Gaussian basis functions. One advantage of the 
Gaussian basis set is that it permits computation of matrix elements of the energy and 
a number of other properties completely analytically, which allow them to be computed to 
whatever precision is desired. Another advantage of Gaussians is that they are very localized 
in real and momentum space resulting in sparse matrices. The primary disadvantage of 
Gaussian-based HF is that it formally scales as N A , where N is the number of atomic orbital 
basis functions. Despite that single disadvantage HF is often the choice of users for geometry- 
optimization of molecules because usually some combination of basis-set choice, cutoff and 
other approximations define HF calculations that are the fastest all-electron calculations 
using current, commercial, quantum chemistry computer packages. HF-based methods can 
alway be systematically improved towards the exact results. 

Density functional (DF) based models form the second class of methods for studying 
the ground-state properties of materials. The most important aspect of density functional 
theory is relative computational simplicity which leads to better scaling and the ability to 
optimize the functional form of the Kohn-Sham (KS) multiplicative potential. Today, KS 
density functional models are the predominant choice for pure electronic structure calcula- 
tions. DFT formally scales as N 3 . Various implementations of the DF models that use a 
variety of different type basis sets as well as those that are fully numerical (use no basis 
sets) have been reported. Most of these implementations including those that use analytic 
Gaussian basis sets require use of numerical mesh to compute the contribution from the 
XC term. The complexity of the functional forms, particularly for all-electron calculations 
using the gradient-corrected functionals can require a very sophisticated numerical grid of 
points to integrate the XC energy density to get the total energy accurately. The use of 
grids makes calculations of matrix elements that are accurate to arbitrary precision (as in 
the Hartree-Fock method) practically impossible and leads to number of problems such as 
steps or kinks in the potential energy surface, spurious negative frequencies, etc. Also, a nu- 
merical total energy incorrectly depends on how the molecule is oriented with respect to the 
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orientation of the numerical grid. Consequently, the total energy is not exactly rotationally 
invariant . Some of these problems can be eliminated by using very refined numerical 
grids which, needless to say, leads to reduction in computational speed that compromises 
the inherent efficiency of the DF models. DF implementations that are fully analytic are 
obviously desirable. 

While translating the Slater-type-orbital DF method of Baerends, Ellis and Ros [6( for 
Gaussian-type-orbitals, Sambe and Felton |3| proposed treating the KS equation completely 
analytically, which was a significant first step towards analytic DFT, however a numerical 
grid was needed to fit the XC potential. This works quite well for the subset of DF theory in 
which the same fits can be used in the KS equations and the energy thereby almost preserving 
the variational principle; i.e. preserving the variation principle when a complete basis set is 
used to fit the XC potential ^|. These efforts greatly improved upon the muffin-tin (MT) 
approach to DFT, which spherically averaged the KS potential introducing a discontinuity 
in the potential at the surface of the spheres. Direct integration of the potential in the 
MT approximation leads to an undefinable energy. Instead, as is common even today with 
diverse formulations of DFT, one simply makes the same approximations in evaluating the 
energy that one mak es in co m pn ti n g the potential BQflQ. In snch cases today one 
argues, where possible, that the energy and the KS potential are computed exactly, therefore 
the calculation is variational. In this work the calculation of the energy and potential is 
inexact due to incomplete basis sets, but the calculations are to machine-precision explicitly 
variational. 

Our variational solution 8] to the problem of fitting any charge distribution has become 
quite popular, particularly for simplifying MP2. While adding second derivatives [3| to a 
descendant of our code, DGauss a different fit was used to simplify the MP2 energy 
expression. That new fit was called Resolution of the Identity (RI)[lq. In a later application 
of their RI method, one of the definers of RI showed empirically that our fitting method 
was better than the original approach \v\. Many have attributed this improvement to some 
unknown 0, 18] or not clearly relevant 3] properties of the Coulomb norm and have 



told us that others 



21] used the Coulomb norm before our work. Our work concerns 



variational fitting, whereas RI, which if it is not precisely our method, requires a complete 
basis set. 

The fits used in the later RI work are precisely the variational solutions that maximize 
the unique robust (no first-order error due to fitting) Coulomb self-energy of the fitted 
charge distributions. Furthermore, with these fits, the RI energy calculated using three- 
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center approximations to the two-electron integrals is identical to the same expression using 
the two-center approximations to the two-electron integrals that are obtained using fits to 
both charge distributions 22j. In the following, we examine a Taylor-series expansion in the 
error made in fitting two-electron integrals in all ab initio energies. We define good (but 
not perfect) fits to have negligible quadratic and higher errors. All XC-related, robust and 
variational fits used in this work do not involve the Coulomb norm. 

Chemists are beginning to say that our method does not work in RI applications to large 
systems (23I . I24I l^ . The physics problem is that an unbalanced charge, no matter how 
small or how far separated, when arranged on a infinite, periodic lattice has an infinite 
Coulomb self-energy. A way to handle this fact is to constrain the fits to have the right 
charge jfj 13, 21, S] If this is done, then by Gauss' law the electrostatic potentials of the 
fitted and exact charge distributions become identical as soon as they possibly can, outside 
the two distributions. Our code uses both constrained and unconstrained fits. The fits are 
robust, and if they are good (but not perfect) then the constrained and unconstrained energy 
are the same within typical quantum chemical tolerances. 

This work also uses robust and variational fits of the XC potential. A set of DFT models 
that have the same functional form in the energy density and the KS potentials, 



E xc[p] = ^J p(r)v x [p(r),r\d 3 r, 



^[p] = -4© V V /3 (r), (1) 



with 



where a is the Slater exchange parameter (which was found to be 1 by Slater [29] and 2/3 by 
Gasparj^l and Kohn and Sham P] (GKS)), is easiest to treat analytically. The difference 
in the two values of a has roots in the averaging process employed in the simplification of 
the HF exchange potential. Slater obtained a = 1 while simplifying the nonlocal exchange 
potential of the Hartree-Fock approximation by averaging exchange potential over the entire 
Fermi sphere of radius kf = (37r 2 p(f)). The GKS value of 2/3 is obtained by applying the 
variational method to the statistical total energy expression, and only uses an average over 
surface of the Fermi sphere k = kf(r) 1 ^ 3 . Later, Kohn and Sham also set a to be 2/3 for 
general energy functionals through the construct of the non-interacting electron gas. Most 
DF models that are in practice today use the GKS value of a. Recently, overall improvement 



in performance of some of these DF models has been noted if a is allowed to vary|3Jj, 

In the Xa method, the form of the exchange potential is given by Eq. [1] and the a are the 
scaling parameters. The Xa method is the outcome of simplifying the Hartree-Fock method. 
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It was suggested that the scaling parameters a could be obtained by ensuring that the self- 
consistent Xa energy for atoms matches the Hartree-Fock energy. Such set of a parameters 

Subsequently, several other ways to determine the a 



was determined by Schwarz 
parameters were proposed 
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The muffin-tin 



implementation of the Xa method allowed the use of atom-dependent a values. The unique 
advantage of this quantum-chemical method was that the model smoothly dissociates into 
atoms as it was pulled apart, independent of the basis-set (partial waves in the muffin- 
tin method). With the ability to choose the atomic energies however one wanted, it is 
a way to extrapolate atomic properties to all homogeneous and heterogeneous materials, 
or alternatively the properties of all elemental molecules or crystals to all heterogeneous 
materials. 

A fully analytic method that allows arbitrary scaling of exchange potential around each 
type of atom in the heteroatomic system was recently formulated and implemented 0]. It is 
called Slater-Roothaan (SR) method. It is free from the problematic energy of the multiple 
scattering Xa model but retains all its advantages. It is computational very efficient and has 
been successfully a ppli ed to study boron and aluminum nitride nanotubes contains about 



two hundred atoms 
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49]. 



The accuracy of the SR method ultimately depends on the choice of scaling parameters, 
a's, used in calculations. Two obvious choices for choosing a are: (1) the a =olhf values 
that give the HF energy for atoms, and (2) the a =oiea values that give the exact atomic 
energies. The use of later set is appealing because when used in molecular calculations, 
as the molecule dissociates the corresponding sum of the exact atomic energies can be 
obtained. Our early calculations on the total energies of molecules shows that remarkably 
accurate total molecular energies can be obtained using the second set [fit]]. These molecular 
energies of the G2 set of 56 molecules are better than or comparable to most pure or hybrid 
density functional models (such as PBE51], B3LYP[52f etc, see Ref. for details). In 
the present work, we examine the overall performance of the Slater-Roothaan method for 
the atomization energies, bond distances, and ionization potentials when the above sets 
of a values are used. We use the extended G2 set of 148 molecules for the benchmark. 
This set is routinely used to examine the performance of various density functional and 
related models. As we shall see, the performance for atomization energies with these two 
combinations of Gaussian basis sets, though better than the local density approximation or 
the Hartree-Fock method, is inferior to that of the DFT models with generalized-gradient 
approximations (GGA). Many density functional models are parametrized to give better 
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atomization energy. Following this practice, we explore the possibility of improving the 
performance of Slater-Roothaan method for atomization energies. This is accomplished by 
determining a new set of a values that minimize the mean absolute error (MAE) in the 
atomization energies of 148 molecules. This parametrization gives accuracy comparable to 
some of widely used gradient-corrected density functional models. 

I. THEORETICAL METHOD 

A. Variation and the Eigenvalue Problem 

Our interest is in the development of analytic DFT through robust and variational fitting. 
The total electronic energy in Hohenberg-Kohn-Sham P, |J DFT for an A^-electron system 
is a functional of electron density p. The electron density is given by 

N 



where, 4>i } a(r) are single particle orbitals, and n it(7 , the occupation numbers for both spins. 
The total electron density p is the sum of spin densities pf and p^. In chemistry, the orbitals 
are still usually expressed in LCAO form, 



j 

The energy is determined by constrained variation of the energy with respect to orthonormal 
orbitals, 



where the angular braces indicate the overlap integral and the e matrix is all the Lagrange 
multipliers needed to allow the orbitals to be made orthonormal while minimizing the energy. 
In DFT a unitary transformation can always be found to diagonalize this matrix to generate 
a conventional molecular-orbital eigenvalue problem. This eigenvalue problem is called the 
Kohn-Sham (KS) equations. We approximate these equations by fitting in uniques ways to 
make them less computationally challenging while preserving the full variational principle. 
The first robust and variational fit was to the self-Coulomb energy of a charge distribution, 




(2) 




(3) 




(4) 





S 



This energy is approximated by expressing the charge density as a fit to a set of Gaussian 
functions, 

i 

where, p(f) is the fitted density, dj is the expansion coefficient of the charge density Gaussian 
basis-function Gj. The elimination of the first order error in the total energy due to the fit 
leads to the unique robust expression for the self-Coulomb energyjjj 

E ee ~We = 2(p\\p)-(p\\p) = (p\\p)+ (Ap\\p), (7) 

where, Ap = p — ~p. Unconstrained variation of this energy gives 

d=(G\\G}- 1 -(G\\p). (8) 

In this work, an overline represents any approximation, good, bad or exact. Thus the identity 
is not necessarily resolved, and we do not use the RI method [3, 12 1. Hopefully we and 
others can generate basis sets that deliver good fits. 

In any event we single out fitted energies that contain no first-order error and call them 
robust, because mathematically speaking, it makes no sense to include non- robust, approx- 
imate energies in any eigenvalue problem. The fit, if it is good, will change if the original 
charge distribution does; the derivatives of good fitting coefficients with respect to LCAO 
orbital coefficients are not zero. In the derivative of the total energy, however, this derivative 
is multiplied by zero if the fit is obtained by variation of a robust energy. It is easy to obtain 
atomic density-fitting bases for which the first-order error is small and approximately equal 
to the square root of the total (quadratic) error made in this approximation j^. (Of course, 
variational fits only have a first-order error if they are constrained.) 

With the advent of fitting basis sets that are individually optimized for corresponding 



orbital basis sets 
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551 ]. it is appropriate to define a stronger definition of good fit. If we can 
use basis sets that are accurate to one part in 10~ 5 then a robust fitted energy is accurate to 
is about one part in 10 -10 , which is a appropriate number to be used for molecular integral 
cutoffs, defining convergence, etc., in ah initio quantum chemistry codes and can probably 
be safely neglected in most of quantum chemistry. We call a fit this good or better a good 
fit. Such a fit is only a good fit, we do not claim it to be exact. This is our target, which at 
this point we cannot guarantee. 

One could, of course, use a non-robust expression for the fitted energy and then correct 
the effects of that error on the orbital variational problem by taking the total derivative of 
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the energy, as written in Eq. 3. In that case the corresponding Fock matrix or KS equations 
differ from what would be obtained by inserting the non-robust fit where appropriate in 
the fit-free eigenvalue or KS equations. The potential of the eigenvalue problem cannot be 
made independent of the energy. If the dependence of the energy on the orbitals is not 



treated variationally, then the forces are not accurate 56]. This fundamental problem prob- 
ably means that the proposal of Sambe and Felton []| to fit the exchange-correlation (XC) 
potential of DFT to LCAO form by numerical sampling (or direct numerical integration) 
must be abandoned. Now almost all DFT codes treat the XC potential by direct numer- 
ical integration, despite the fact that exact forces arising from using an auxiliary basis to 
numerically fit the XC potential are known and have been implemented precisely for 
gradient corrected functionals 58\. The exact partitioning scheme of Becke 0] together 
with adaptive grids that vary with the orbital basis set 0] are now overwhelmingly popular 
with LCAO approaches to DFT 0]. Intermediate in the trade-off between accuracy and 
efficiency between the Sambe- Felton and Becke approaches is one in which the variationally- 
fitted density (Eq. © ) replaces the two-center exact density in the numerical construction 
of the XC matrix elements 

13- 



B. Robust Fitting and the Resolution of the Identity 

The post-HF ab initio energies are functions of the two-electron integrals 

{ct>4M k ct>i) = mkl). (9) 

RI energies are defined to be functions of (ij\\kl), where kl is the fit to the kl orbital pair. 
The post-HF ab initio energies can be expanded in a Taylor series in the difference between 
between an exact two-electron energy and any approximate two-electron energy, 

E{{ij\\U)) ~ E{W\Mj) = J2 n\WM^r^ iiW ^ 

where 

A ijkl =(ij\\ki)-mki). (ii) 

The unique robust two-electron energy 

(ij\\kl) « WPU = (u'P> + GJ||M> - (yp>, (12) 

where both fits are separately determined by Eq. (|HJ), is beginning to be used directly in ab 
initio quantum chemistry |63j]. No matter how both fits are obtained there is no first-order 
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error, 

(ij\\kl)-WM)ro b =( A ii\\ A kl). (13) 

If Eq. (|SJ) is used then all approximations are exactly equal 

(ij\\kl) = (ij\\kl) = (ij\\kl), (14) 

and the RI and robust (Eq. ()12Jl) approximations to the two electron integral are identical. 

The Coulomb norm is not magical. Coulomb potentials from charge distributions can 
be approximated robustly by fitting the potential due to a charge distribution rather than 
fitting the charge distribution itself j&J. Obtaining good basis sets, however, is more of a 
problem than in the charge distribution fitting case |65j, which is being overcome For 
ab initio energies, 

Emm - E(mkT) rob ) = E^^^( a -ii a ^- ( 15 ) 

If both fits are good, then this equation practically equals zero, i.e, (Ay||A^), being 
quadratic in errors, is expected to be accurate to one part in 10 10 . Thus, for good RI 
fits, the fitted energy is practically identical to the original energy 



E((ij\\kl)) ~ E((ij\\kl) rob ) = E((ij\\kl)) = E((ij\\kl)) = E((ij\\kl)), (16) 

where the last three equalities hold only if the fits are obtained using Eq. (jSJ). 

These simplest, global fits are problematical for large clusters |23j, |2J|- They could be 
constrained, in which the case the robust, rather than either of these two approximate, two- 
electron energies that involve a single fit must be used to obtain robust energies, independent 
of whether or not the Coulomb metric is used. It might be better to consider changing the 
fitting basis. The amount of charge in the product of two molecular orbitals is either zero or 
one depending on whether or not the orbitals are the same. If the orbitals are the same, then 
our methods are sufficient 0, 0|. If the orbitals differ, as is the case in MP2, then all fitting 
basis functions should contain no charge, and a similar p RI basis for atoms corresponding 
to our s-type density-fitting basis, would include all the atomic p exponents in the orbital 
basis, but rather than doubled simply incremented by the smallest s exponent. This picks 
up the smallest p-type contribution possible from the product of primitive p and s orbital 
functions and this basis has a flexibility equal to that of the p orbital basis. For most basis 
sets this is smaller than the correspond s-type charge-density fitting basis. Since the basis 
is neutral and the product of different orbitals pairs to be fit is neutral, then all interactions 
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die off as fast as possible by Gauss' law. All non-s fitting functions have no charge, therefore 
the L > 1 charge- density fitting functions are probably a good basis sets for fitting both the 
diagonal and non diagonal products of two molecular orbitals. This is obviously a better 
basis than any currently used for studies of large systems. If the RI fitting basis behaves 
well asymptotically, then the Coulomb norm is likely to be proved best again for standard 
RI approximations to the MP2 energy of infinite systems [^j]. A zero-charge basis will 
likely to be effective in density functional perturbation theory as well, if the resolution of 



the identity 68, 69] or ideally robust and variational fitting is used. With such a basis we 
envision keeping constrained fits as a simple option to test the stability of the calculation 
even for large systems. 



C. Variational Fitting 

We take the full variation of the energy with respect to the orbitals before solving the 
eigenvalue problem in order to obtain precise forces. If we modify the energy by adding any 
number of robust fits, 

f[a t {r)} = mi?)} + Order [(a*-^) 2 ], (17) 
ai(r) = ^ciijA^r), (18) 

ij 

where Ai(f) is an appropriate LCAO basis for the the i th fit, to approximate troublesome 
terms in the energy, then the Fock matrix for the corresponding eigenvalue equation is 
obtained by the chain rule of differentiation, 

dE dE da k i ^ 

dCij ~f da^i dCij 

If robust fits are available, then we can improve upon them by obtaining the fits through 
a variation of our robust energy, exactly as orbitals are obtained. If and only if the fits are 
variational can the same fits be present in both the energy and the corresponding eigenvalue 
problem for the orbitals. 
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D. Analytic formulation of the Gaspar-Kohn-Sham-Slater density functional 
model 



The KS energy is given by 



N 



E KS [p] =J2 < > + E *c[PhPl] 



(20) 



where, the first term contains the kinetic energy operator and the nuclear attractive potential 
due to the M nuclei, 

V72 M 7 

a = -t-Ef%t- (21) 

The second term in Eq. (f2T)|) represents the classical Coulomb interaction energy of electrons 
discussed above. It is approximated robustly through Eqs. (J7J) and (JBJ). The last term E xc 
in Eq. (J27?j) is the exchange energy, 



W 3 f*' ' 



(22) 



The form of above functional allows analytic calculations with the Gaussian basis to be 
performed. For this purpose the one-third and two-third powers of the electron density are 
expanded in Gaussian basis sets: 



ps(f) « pa = ^dEi 



(23) 
(24) 



Here, {Ei} and {Fi} are independent Gaussian basis functions , w hile and fi are expansion 
coefficients. The exchange energy is then given 



F X c C a 



where C a = —9a ( | J . Thus using the four LCGO basis sets (one for orbital expansion 
and three fitting basis sets) the total energy is calculated analytically. The LCAO orbital 
coefficients and the vectors d, e, and f are found by constrained variation. 



4, _I, 2 1 _i _2 1 2 a. 

g(PP 3 )- g(P 3 P 3 P 3 ) + gW 3 ) 



(25) 
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E. Slater-Roothaan method 



The expression for the total electronic energy in the Slater-Roothaan method has the 
following form: P| 



E SR = < <f>i\h\<t>i > + 2 (p\\p) ~ (p\\p) 

i 

- ^ p4 i 2 i i 2 
- 2^ C x - (gn gi) + - (g\ g\ 9\) 



+ ah) ■ (26) 

Here, C x = C a /a; the partitioned 3/4 power of the exchange energy density, 



) = 5>(«>0")fl&tf), (27) 



where D^{r) is the diagonal part of the spin density matrix, and the function, 

a(i) = a 3 / 8 (28) 



contains the Oj, the a in the Xa for the atom on which the atomic orbital i is the centered. 
The fits to powers of g a corresponding to Eq. 23-l2"4l are obtained variationally from Eq. 



II. COMPUTATIONAL DETAILS 

The analytic SR method requires four Gaussian basis sets. One for the orbital expansion 
and others to fit different powers of electron density, which we obtain from literature. We 



l density, 

?0,Q 



choose Pople's triple-C (TZ) 6-31 1G** basis (75|, |76j and the D Gauss |15j| valence double-^ 
(DZ) basis setQ called DZVP2 for orbitals basis sets. The s-type fitting bases are obtained 
by scaling and uncontracting the s part of the orbital basis. The scaling factors are 2 for 



2 



the density, | for p5 and | for p3 . These scaled bases are used for all s-type fitting bases. 
Ahlrichs' group has generated a RI-J basis for fitting the charge density of a valence triple-^ 
orbital basis set used in the Turbomole program [55(. The non-s parts of Ahlrich's fitting 
bases are used in combination with the 6-31 1G** orbital basis sets. Hereafter, we shall 
refer to this combination of basis sets as basis set I or 6-311G**/RIJ . In combination with 
DZVP2 orbital basis, we use the pd part of the A2 charge density fitting basis. This will be 
referred to as basis set II or DZVP2/A2. The geometries of molecules were optimized using 
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the Broyden-Fletcher-Goldfarb-Shanno (BFGS) algorithm 0, H IZS, 



80, 811. The forces 



on atoms are rapidly computed non-recursively using the 4-j generalized Gaunt coefficients 
0, 3- The atomic energies are obtained in the highest symmetry for which the self- 
consistent solutions have integral occupation numbers. The atomization energy is computed 
from the total energy difference of optimized molecule and its constituent atoms. The open 
shell atoms are treated in lower than spherical symmetry so that they could have integrals 
occupation numbers. Thus, following point group symmetries were chosen: C2 V for C, O, Si 
and S atoms, for atoms F, Al, and CI, D 3 h for B, and Ih for Li, N, Na and P. 

A database of geometries of set of 148 molecules, known in literature as the extended 
G2 set, was built. The molecules were built with appropriate symmetries to expedite their 
structure optimization. A new set of a values was optimized by minimizing the mean 
absolute error in the atomization energies of the 148 molecules: 

148 



{oiopt} 



mm 



E 



\Dl({a,}) - Dj(exact)\ 
148 



(29) 



where, D l e ({aj}) is the atomization energy of the i th molecule with j th set of alpha values. 
The {aj} optimization was repeated by starting with two different sets of alpha values: olhf 
and oiea- We used the simplex method jsi| and during each iteration of {c^} optimization 
procedure, all 148 molecules were reoptimized. The {a:j} optimization process all together 
involved about 10000 optimization of molecules, which was accomplished by the use of 
PERL scripts. The final optimal set gives the best atomization energies for the simple 
element dependent Slater's exchange functional employed in the present model. It is called 
aopT hereafter. 



III. RESULTS AND DISCUSSION 
A. Atomization energies 

The two sets of a values used calculations in these calculations are given in Table |U The 
last column in this table is a new set of a values, which as noted earlier, is parametrized 
to give the best atomization energies. The computed atomization energies for the G2 set 
of 148 molecules are given in Table |HJ The point group symmetries of molecules are also 
listed in the Table |H] The last two rows in Table |H] give the mean absolute error (MAE) 
and the mean error (ME) in the atomization energies of the 148 molecules. It is apparent 
from the MAE and ME that the basis sets effects are marginal. The errors with smaller 
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basis DZVP2/A2 are of similar magnitude to those obtained by the larger 6-311G**/RIJ 
basis set. For the exact atomic alpha, oea, values the MAE is about 34 kcal/mol which is 
larger than that for the atomic Hartree-Fock, olhf , values (~ 26 kcal.mol). In the former 
case, the atomic energies by construction have zero error. In the case of olhf , the atomic 
energies are equal to the Hartree-Fock energies. So error in atomic energies is roughly the 
correlation energies of these atoms 84]. As the olea values give exact total energies of atoms, 
it was expected that their use in molecular calculations will result in overall improvement 
in molecular properties. This expectation is not unreasonable as atomic energies are exact 
by construction and the molecules in the dissociation limit would give sum of exact atomic 
energies. It is therefore surprising that the cxhf set performs better than the oiea set. The 
better performance for the former set is clearly a consequence of error cancellation of total 
energies of molecules and atoms. On the other hand, the MAE in atomization energies is the 
same as error in total energies when oea values of used. No error cancellation occurs in this 
case. Note that performance of many density functional models for atomization of energies 
is due to such cancellations of errors in total atomic and molecular energies j^j]. Many of 
these sophisticated functionals perform poorly for the total energies. The ME reflect that 
use of ole a and o-hf values lead to an overall overestimation of the atomization energies. 
The largest deviation of about 146 kcal/mol occurs for the C2F4 molecule. There are about 
6 molecules with absolute errors larger than 100 kcal/mol. These are CF4, CF3CN, Cef^, 
C4H4O, C5H5N, and C 2 F 4 . 

Optimizing the SR method for atomization energies significantly improves its performance 
as can be seen from the Table |HJ The MAE and ME in atomization energies are reduced 
to 14.5 kcal/mol and -4.5 kcal/mol. The optimization process varies most the a values of 
atoms on the left of periodic table and those on the right side. The a values of N, O, 
and F are reduced with respect to the Qhf and olea values. The maximum absolute error 
is also reduced significantly. Overall performance of olopt set for atomization energies is 
comparable to that of the Perdew-Burke-Ernzerhof [5]]] generalized gradient approximation 
(PBE-GGA)f 



B. Bond distances 



The bond distances of 27 selected diatomic molecules |8 71] are compared in Table ITTT1 with 
their exact counterparts. The MAE and ME in bond distances are in the last two rows of 
the same table. The maximum MAE is about 0.02 A. Except for the larger basis with otHF 
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the ME are negative. Unlike LDA or GGA, there is no consistent trend of overestimation 
or underestimation. On the whole the bond distances are smaller than the experimen- 
tal bond distances. This trend is different from that of the local density approximation 
(Slater exchange + Vosko-Wilk-Nusair correlation, called sometimes as SVWN) and GGA 
functionalsj^], 0] which either overestimate or underestimate bond distances. The slight 
underestimation of bond distances, on average seems consistent with the on the whole over- 
estimation of atomization energies. In particular, the olea bond distances are shorter and 
the atomization energies higher. The present value of MAE (Table 11111 can be reduced if 
molecules containing Li and Na are omitted. The largest deviation of 0.09 A occurs for 
the Na2- Also, F2 is another molecule which is difficult for the present models. Indeed, our 
study on these diatomics show that even a very small value of a leads to binding of the 



F 2 molecule 



431 ] . Overall the bond distances are fairly accurately predicted for both the a 



values are used, with o>hf values giving somewhat better performance. The bond distances 
with new otopT set are not given in Table ITTT1 Using this set, the MAE in bond distances of 
27 molecules increases to 0.04 A. 



C. Ionization potentials 



The ionization potentials of 49 atoms and molecules [8 9j computed for the oiuf and olea 
values are compared with their experimental counterparts in Table IIVI In computing the 
ionization potential (IP) the energy of the cation was calculated at the geometry of the 
neutral molecule. Also, the symmetry of the neutral system was assumed for the cation. 
This choice of convenience will tend to exaggerate the errors in IP for some systems. For 
example, the IP of nitrogen atom and methane molecule, are overestimated. For N atom, 
broken-symmetry calculation will give lower energy for cation. Similarly, Jahn- Teller dis- 
tortion will lower the energy of methane cation and will reduce the error in the (adiabatic) 
IP. The MAE in IP for two sets of a values are about 0.5 eV. Both, MAE and ME are 
of roughly similar magnitude for both basis sets combinations indicating that the basis set 
effects are negligible. The sign of deviations do not show any consistent trend of overes- 
timation or underestimation. The electron affinities could be similarly computed from the 
energy difference of the neutral molecule and its anion. However, we did not compute the 
electron affinities as the computation of electron affinities within the most density functional 
models (including the present one) is problematic as these quantities are sensitive to the self- 
interaction of electrons. The presence of self-interaction error leads to exponential decay of 
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the effective potential in the asymptotic region as opposed to the correct — 1/r asymptote. 
The additional electron in the anion therefore experiences shallower potential than it other- 
wise should. This sometimes leads to the positive eigenvalue for this electron although the 
total energy may be lowered. For this reason we did not compute electron affinities but we 
expect that accuracy of the present method for electron affinity will be similar to that of 
the local density approximation. 



IV. SUMMARY 



Analytic quantum mechanics using Gaussian basis sets has been extended. RI ab-initio 
energies evaluated using two electron integrals in which both charge distributions are vari- 
ationally fitted are themselves variational with respect to the fitting parameters. The per- 
formance of fully analytic density functional model called the Slater-Roothaan method, is 
examined for various properties, using two different sets of a values, for two different basis 
sets. The comparison of two sets of a values used in this work show that a values obtained 
by matching atomic Hartree-Fock energies (a^ ) give better performance that those ob- 
tained by equating atomic energy to the exact atomic energies (chhf )• The basis set effects 
as judged from the comparison of results obtained with two sets of bases show that smaller 
DZVP2/A2 basis provides accuracy comparable to the larger 6-311G**/RIJ basis. The 
MAE in atomization energies of the SR method with auF and olea sets are about 26 and 34 
kcal/mol, respectively. In comparison with the local density approximation or the Hartree- 
Fock method, the SR method with oluf an d o.ea sets performs quite well for atomization 
energies, bond distances and ionization potentials. Further improvement in performance of 
the Slater-Roothan method for atomization energies is obtained by parameterizing it for 
atomization energies. A new set of a parameters (aopr ), is determined by minimizing the 
MAE in atomization energies of a set of 148 molecules. With this set of olopt parameters, the 
mean and mean absolute errors in atomization energies of 148 molecules are -4.5 kcal/mol 
and 14.5 kcal/mol. This makes its performance for atomization energies comparable to that 
of the PBE-GGa|86|. This is remarkable considering it is perhaps the simplest of the den- 
sity functional models and it is certainly the oldest such model. The hybrid and even more 
complex functionals like B3LYP0 or the PBEO0 perform better than the PBE-GGA. To 
further improve the performance of the SR method for the atomization energy it is necessary 
to go beyond the simple element dependent Slater's exchange functional form employed in 
this work. Considering the overall performance of SR method, it is a good starting point 
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for optimization of large systems. Many calculations employ Hartree-Fock method to op- 
timize geometries or to explore potential energy surfaces, and then compute properties of 
interest at a more sophisticated level. The analytic SR model scales better than the HF 
method and being analytic it is necessarily computationally efficient. One could use the a 
value used in this work to optimize structures and then perform single point calculations, 
if feasible, using more sophisticated models. The ability of density functional models to 
perform large scale calculations is well known. Of the plethora of density functional models 
of different accuracy, choosing the simpler ones like the local density approximation further 
enhances one's ability to study large systems and/or to perform longer molecular dynamics 
simulations. Indeed, several recent large scale simulations are performed using local density 
approximation 0, 0, 0|. In a recent work, the local density approximation (SVWN) was 
shown to be about 55% faster than the B3LYP and 40% faster than the BLYP for a calcu- 
lation on 9-alanine system j^]. The analytic implementation like the present one, being free 
from grids, will perform even better and is a promising alternative for large scale calculations 
at modest accuracy. 

The Office of Naval Research, directly and through the Naval Research Laboratory, and 
the Department of Defense's High Performance Computing Modernization Program, through 
the Common High Performance Computing Software Support Initiative Project MBD-5, 
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TABLE I: The set of a values used in the present work. The olhf values are due to Schwarz 
[3| (spin-polarized by Connolly 



111 ]) and the oea values are from Ref. |43|. The cxopt values in 
the last column are obtained by minimizing the mean absolute error in atomization energy of 148 
molecules (see text for more details). 



Molecule 


OL U T? 
^ri r 


Q£ A (6-311G**/RIJ) 


apM(DZVP2/A2) 


OV~) DT 
^(J r 1 


H 


0.77627 


777390 


781240 


753703 


Li 


0.77157 


0.791690 


0.792110 


0.839295 


Be 


76823 


795740 


796140 


557058 

V / . '.7 * 7 1 V 1 ".7 l_J 


B 


76206 


786750 

v7 • 1 WW 1 ^J\J 


786770 


674430 

v7 • V7 1 a **J \J 


C 


0.75331 


0.776770 


0.776650 


0.675039 


N 


0.74522 


767470 


767260 


645482 

V7 . V7 -L '.7 J- v7 


o 


0.74188 


765000 


764480 


657869 

V7 • V7 v7 1 V. 7 V7 *7 


F 


0.73587 


0.760660 


0.760010 


0.575118 


Na 


0.73115 


0.752040 


0.752870 


0.823779 


Mg 


0.72918 


0.749940 


0.751200 


0.768385 


Al 


0.72853 


0.748220 


0.748690 


0.743690 


Si 


0.72751 


0.745390 


0.746020 


0.767716 


P 


0.72620 


0.743240 


0.743970 


0.860982 


S 


0.72475 


0.742620 


0.743500 


0.743745 


CI 


0.72325 


0.741970 


0.742720 


0.662447 
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TABLE II: The deviation in atomization energies for the 
set of 148 (extended G2 set) of molecules within different 
computational models- Ml: SR-a S A/6311G**/RIJ. M2: SR- 
OEA/DGDZVP2/A2, M3: SR-aj/ F /6311G**/RIJ, M4: SR- 
a HF /DGDZVP2/A2, M5: SR-a 0p t/DGDZVP2/A2, All en- 
ergies are in kcal/mol and are calculated at the optimized 
geometries of molecules in the respective model. The exact 
atomization energies are from Ref. 



Molecule 


Symmetry 


Ml 


M2 


M3 


M4 


M5 


Exact 


H 2 


Deft 


-24.9 


-22.7 


-24.9 


-22.8 


-23.4 


110.0 


LiH 


Cq v 


-19.6 


-23.9 


-20.1 


-24.4 


-19.6 


57.7 


BeH 


Cq v 


8.6 


-16.9 


6.3 


-18.5 


1.7 


49.6 


CH 




-16.7 


-15.5 


-17.1 


-16.2 


-16.2 


83.7 


CR 2 ( 3 B 1 ) 


c 2l , 


5.5 


7.2 


1.8 


2.6 


-7.4 


189.8 


CH^Ai) 


c 2t) 


-22.4 


-19.1 


-24.3 


-21.7 


-25.6 


180.5 


CH 3 


D 3 / t 


-7.2 


-4.0 


-11.5 


-9.4 


-19.5 


306.4 


CH 4 


Td 


-11.8 


-7.3 


-16.9 


-13.9 


-23.4 


419.1 


NH 




-16.3 


-14.6 


-16.7 


-15.4 


-15.1 


83.4 


NH 2 


c 2t) 


-24.4 


-20.0 


-25.5 


-22.0 


-22.9 


181.5 


NH 3 


C 3t , 


-25.5 


-18.9 


-27.6 


-22.6 


-25.8 


297.3 


OH 


C6d 


-7.7 


-6.2 


-8.7 


-7.8 


-11.2 


106.3 


H 2 


c 2l) 


-4.1 


0.7 


-6.9 


-3.5 


-13.9 


232.1 


HF 




4.1 


7.7 


1.5 


4.3 


-10.7 


140.7 


Li 2 




-17.8 


-18.8 


-17.8 


-18.8 


-18.7 


24.4 


LiF 


Cq v 


8.9 


-2.4 


4.2 


-7.3 


-26.1 


138.8 


C 2 H 2 


D 6h 


17.5 


8.9 


11.7 


1.1 


-13.1 


405.3 


C2H4 




9.2 


11.1 


0.6 


1.0 


-17.3 


562.4 


C 2 H6 


D M 


1.0 


7.4 


-9.3 


-4.6 


-19.8 


710.7 


CN 


C6d 


11.5 


2.4 


8.3 


-0.3 


-8.0 


179.0 


HCN 


Cq v 


1.4 


-8.5 


-3.3 


-13.3 


-24.7 


316.3 


CO 


Cq v 


24.0 


11.3 


19.6 


7.4 


-6.1 


259.2 


HCO 


Cih 


29.1 


23.1 


23.4 


17.4 


0.8 


278.3 
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H 2 CO (formaldehyde) 


C2v 


21.0 


17.8 


14.5 


10.6 


-8.3 


373.4 


H3COH 




10.3 


15.4 


2.2 


5.7 


-13.5 


511.6 


N 2 




-13.5 


-24.9 


-15.3 


-26.2 


-32.6 


228.5 


N 2 H 4 


c 2 


-24.2 


-12.9 


-29.3 


-18.8 


-15.2 


437.8 


NO 


c 2l , 


10.3 


2.4 


8.2 


0.6 


-6.3 


152.9 


o 2 


D 6/t 


37.1 


33.6 


34.1 


30.7 


20.1 


120.4 


H 2 2 


C 2 fe 


14.7 


20.4 


9.8 


14.5 


-1.0 


268.6 


F 2 


D 6h 


28.0 


29.8 


26.5 


27.8 


13.7 


38.5 


co 2 


D 6h 


67.6 


49.0 


58.1 


40.4 


10.7 


388.9 




c 2t) 


-26.0 


-23.4 


-25.1 


-23.5 


-24.6 


151.4 


SiH 2 ( 3 5i) 


c 2l) 


-9.5 


-8.1 


-9.9 


-9.5 


-6.9 


130.7 


SiH 3 




-32.0 


-30.4 


-32.0 


-31.9 


-29.0 


226.7 


SiH 4 


Td 


-39.7 


-37.3 


-39.0 


-38.5 


-37.4 


321.4 


PH 2 


c 2t) 


-26.7 


-22.4 


-26.2 


-22.3 


-10.2 


152.8 


PH 3 


C3t, 


-38.6 


-32.0 


-38.1 


-31.8 


0.6 


242.0 


H 2 S 


c 2l , 


-18.7 


-13.0 


-19.3 


-14.0 


-14.4 


182.3 


HC1 




-4.8 


-3.7 


-5.7 


-4.9 


-8.0 


106.2 


Na 2 


D 6h 


-11.9 


-11.6 


-11.8 


-11.5 


-11.9 


16.8 


Si 2 




-2.7 


-2.5 


-3.6 


-3.6 


-1.2 


74.7 


P 2 


D6/i 


-23.1 


-22.7 


-23.6 


-23.2 


-18.7 


117.2 


s 2 


D 6h 
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cs 
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7.2 


4.8 
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so 
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22.7 


14.1 


19.3 


8.8 


125.1 


CIO 


Cq v 


15.2 


21.3 


12.8 


18.7 


9.7 


64.3 


C1F 


Cq v 


19.1 


24.3 


16.2 


21.2 


1.5 


61.4 


Si 2 H 6 


D 3rf 


-54.1 
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so 2 


C2v 
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BC1 3 
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Td 
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CH 3 CCH (propyne) 
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C3H4 (cyclopropyne) 
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CH3CHCH2 (propene) 
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C3H6 


D 3h 


C3H8 (propane) 
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Bsd 
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C4H6 (bicyclobutane) 
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C2v 


C4H10 (butane) 


C 2 ft 


C4H10 (isobutane) 


C3t, 


CsHg (spiropentane) 




CeHg (benzene) 


D 6h 
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27.9 


15.4 
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30.9 


28.5 


23.3 


280.0 


92.2 


79.3 


79.7 


67.3 


17.4 


423.0 
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24.8 


18.8 
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39.1 
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25.0 


-2.5 
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27.6 


9.8 


12.0 
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31.9 
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22.0 


-1.9 


4.6 
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1.9 


1009.0 
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24.5 
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44.6 
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57.9 
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43.9 
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1299.0 


28.4 
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-9.0 


1301.0 
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53.2 
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8.1 


1281.0 
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116.3 


90.6 


89.5 


54.2 


1362.0 



CH2F2 (difluromethylene) 
CHF3 (trifluromethane) 
CH2CI2 (trichloromethane) 
CHCI3 (chloroform) 
CH3NH2 (methylamine) 
CH 3 CN (methyl cynaide) 
CH3NO2 (nitromethane) 
CH3ONO (methyl nitrite) 
CH3SiH3 (methyl silane) 
CHOOH (formic acid) 
HCOOCH3 (methyl formate) 
CH3CONH2 (acetamide) 
C2H4NH (aziridine) 
CNCN (cyanogen) 
(CH 3 ) 2 NH (dimethylamine) 
CH3CH2NH2 (trans ethyalmine) 
CH 2 CO (ketene) 
C2H4O (oxirane) 
CH3CHO (acetaldehyde) 
HCOCOH (glyoxal) 
CH3CH2OH (ethanol) 
(CH 3 ) 2 (dimethylether) 
C2H4S (thioxirane) 
(CH 3 ) 2 SO (dimethyl sulfoxide) 
CH3CH2SH (ethanethiol) 
(CH 3 ) 2 S (dimethyl sulphide) 
CH 2 CHF (vinyl fluxide) 
CH3CH2CI (ethyl chloride) 
CH2CHCI (vinyl chloride) 
CH3CHCN (acrylonitrile) 
(CH 3 ) 2 CO (acetone) 
CH3COOH (acetic acid) 
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47.0 


47.0 
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31.0 
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Cih 


59.0 


55.5 


Cih 


47.8 


45.3 
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Cih 


45.9 


46.9 
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33.9 
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23.6 
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5.3 


15.1 


Cih 
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19.4 


25.7 


Cih 


31.0 


33.7 


Cih 


44.2 


33.3 


C2v 


47.9 


50.3 


Cih 


65.7 


62.5 
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36.9 


35.8 


-10.9 


439.0 


68.9 


62.3 


-0.4 


462.0 


17.6 


21.8 


-0.0 


370.0 


37.7 


42.2 


15.1 


344.0 


-19.5 


-13.1 


-23.1 


581.0 


9.9 


2.1 


-25.0 


615.0 


45.7 


42.1 


5.5 


603.0 


36.1 


33.1 


4.4 


601.0 


-33.5 


-30.7 


-34.8 


627.0 


40.7 


33.8 


3.8 


503.0 


45.4 


41.7 


1.3 


788.0 


29.4 


30.4 


22.9 


867.0 


-8.0 


-3.2 


-9.2 


719.0 


32.1 


9.7 


-18.2 


501.0 


-16.7 


-9.0 


-23.4 


869.0 


-13.4 


-5.4 


-20.1 


877.0 


41.6 


30.0 


-1.2 


533.0 


25.6 


27.4 


-4.5 


651.0 


21.7 


20.5 


-8.5 


677.0 


53.6 


45.3 


8.6 


636.0 


9.5 


14.0 


-12.2 


810.0 


3.3 


8.2 


-18.5 


799.0 


13.1 


18.5 


-0.0 


624.0 


0.8 


12.0 


-16.0 


853.0 


-3.7 


5.3 


-10.3 


767.0 


-8.2 


0.2 


-11.1 


766.0 


32.6 


32.5 


-2.5 


573.0 


6.8 


12.1 


-12.0 


691.0 


20.6 


22.3 


-1.2 


542.0 


29.7 


18.8 


-12.9 


761.0 


29.3 


31.1 


-6.0 


978.0 


48.9 


45.0 


0.6 


804.0 



CH 3 COF (acetyl fluride) 




71.0 


67.8 


55.1 


51.6 


2.6 


707.0 


CH3COCI (acetyl chloride) 


C\h 


57.2 


57.0 


42.8 


42.0 


3.3 


669.0 


CH3CH2CH2CI (propyl chloride) 


Cih 


32.5 


40.8 


14.6 


21.8 


-5.4 


985.0 


(CH 3 ) 2 CHOH (isopropanol) 


Cih 


35.0 


43.0 


16.1 


22.6 


-8.2 


1108.0 


CH3CH2OCH3 (methyl ethylether) 


Cih 


32.4 


40.4 


13.3 


19.4 


-7.6 


1096.0 


(CH3)3N (trimethylamine) 


Csv 


15.8 


25.6 


-3.0 


4.6 


-24.4 


1160.0 


C4H4O (furan) 


C2v 


103.1 


101.4 


82.3 


79.5 


39.7 


992.0 


C4H4S (thiophene) 


C2v 


87.3 


92.7 


66.0 


71.1 


51.6 


960.0 


C4H4NH (pyrole) 


C2v 


-15.1 


-12.6 


-35.5 


-33.9 


-48.0 


1168.0 


C5H5N (pyridine) 


C2v 


105.1 


105.4 


80.2 


80.2 


43.3 


1234.0 


HS 


Cq v 


-12.2 


-9.3 


-12.2 


-9.5 


-9.9 


87.0 


CCH (ethynyl radical) 


Cq v 


30.8 


21.8 


24.7 


15.2 


-1.7 


262.0 


CH 2 CH (vinyl radical) 


Cih 


19.4 


18.7 


11.3 


9.7 


-8.0 


443.0 


CH3CO 


Cih 


43.2 


39.3 


31.4 


27.1 


-3.5 


581.0 


CH 2 OH (hydroxymethyl) 


Ci 


20.2 


21.9 


12.9 


13.3 


-6.0 


409.0 


C1NO 


Cq v 


-42.6 


-44.5 


-48.1 


-49.6 


-70.4 


192.0 


NF 3 


Csv 


64.0 


64.3 


57.0 


57.5 


15.9 


209.0 


PF 3 


Csv 


41.0 


45.9 


29.6 


34.6 


29.4 


359.0 


3 


C2v 


50.8 


52.7 


46.7 


48.7 


34.4 


148.0 


F 2 


C2v 


50.6 


55.3 


47.2 


51.3 


30.7 


94.0 


CIF3 


Csv 


-8.2 


-6.9 


-14.0 


-12.8 


-53.8 


126.0 


C2F4 


D 2h 


146.2 


138.2 


126.5 


118.6 


36.5 


592.0 


CH3O (methoxy radical) 


Cih 


15.2 


17.2 


7.8 


8.9 


-8.9 


400.0 


CH3CH2O 


Ci 


31.0 


35.2 


17.9 


21.3 


-2.4 


696.0 


CH3S (methylsulfide radical) 


Cih 


4.2 


9.4 


-2.6 


1.7 


-7.0 


381.0 


CH3CH2 (ethyl radical) 


Cih 


8.4 


12.9 


-1.3 


1.8 


-16.9 


601.0 


(CH3)2CH (isopropanyl radical) 


Cih 


22.6 


28.7 


7.6 


12.2 


-10.9 


898.0 


(CH3)3C (isobutanyl radical) 


C^ v 


37.3 


45.2 


16.7 


22.8 


-9.4 


1195.0 


NO2 (nitrogen dioxide) 


C2v 


51.5 


41.1 


45.9 


36.2 


18.0 


228.0 


MAE: 




33.9 


33.7 


26.4 


25.4 


14.5 




ME: 




25.0 


25.6 


15.5 


15.4 


-4.8 
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TABLE III: The comparison of bond distances of selected 
molecules from the G2-148 set, calculated within different 
computational models- Ml: SR-a£A/6311G*7RIJ. M2: SR- 
OEA/DGDZVP2/A2, M3: SR-a// F /6311G**/RIJ, M4: SR- 
a#i?/DGDZVP2/A2, All distances are in A. The mean abso- 
lute error (MAE) and mean error are given in last two rows. 



The experimental bond lengths are from Ref. 
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CS 1.511 1.518 1.523 1.529 1.535 

SiO 1.501 1.507 1.510 1.516 1.510 

SO 1.487 1.486 1.498 1.496 1.481 

MAE: 0.020 0.018 0.016 0.017 - 

MEAN: -0.018 -0.004 -0.009 0.006 - 
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TABLE IV: The deviation in the first vertical ionization en- 
ergies with respect to experimental ionization potentials for 
the subsets G2 set of molecules (49 in total) within differ- 
ent computational models- Ml: SR-a£A/6311G**/RIJ. M2: 
SR-OEA/DGDZVP2/A2, M3: SR-aHF/6311G**/RIJ, M4: 
SR-a//ir/DGDZVP2/A2, All energies are in eV and are cal- 
culated at the optimized geometries of molecules in the re- 
spective model. The last column contains exact ionization 



potentials; 


these are obtained from tabulat 


ion in 


Ref. 
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